RAW SEQUENCE LISTING 



ERROR REPORT 



} 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: /o/n¥9 f 7*ffi 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703- 308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://www.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that thei 
a possibility that the disk/CD-Rom may have been affected by treatment given to all incoming mail. 
Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rom. 
Any reply including a sequence listing in electronic form should NOT be sent to the 2023 1 zip code address for the 
United States Patent and Trademark Office, and instead should be sent via the following to die indicated addresses: 

1. EFS-Bio (<http;//www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7 th Floor, Examiner Name, 
Sequence Information, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 



U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
20 1 1 South Clark Place, Arlington, VA 22202 
4. Federal Express, United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequerfce, Room 1B03-Mailroom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 



Source: 
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:e Lb ting Error Summary 



ERROR DETECTED SUCCESTEP CORRECTION SERIAL NUMBER; J 

ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFTWARE 



_Wnppe4Hvddc* 
Wrapped Amino* , 




_PatentIn 2.0 
"bug" 



Skipped Sequence! 

(OLD RULES) 



9_jj 



Skipped Sequence! 

(NEW RULES) 



.Use of n'f or Xu'i 
(NEW RULES) 



10 I nvalid <313> 



_PatentIn 2.0 
"bug" 



The number/text at the end of each lino "wrapped" down to the i*xt line. This may occur if your file 
was retrieved in a word processor after creating it. Please adjust your right margin to J; this win 
prevent "wrapping." 

The rules require that a line not exceed 72 characters in length. This includes white spaces. 

The numbering under each J* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCn(DOS) text, as required by the Sequence Rules. -Please" 
ensure your subsequent submission b saved In ASCII (ext. 

Sequencers) contain n's or Xaa's representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section-that some may be missing. 

A "bug" in Palentln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequences^) . Normally, Palentln would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino arid sequence... This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences." 

Sequence^) missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NOX: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(») SEQUENCE DESCR1PTION.SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 

Please-also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 

Sequence^) missing. If Intentional, please insert the following lines for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 
. Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present 
In <220> to <223> section, please explain location of n or Xaa; and which residue n or Xaa represents. 

Per 1.823 of Sequence SaMcs; the only valid <2 13> response* are: Unknown, Artificial Sequence, or 
scientific name (Oemu/»pecie j). <220>»<223> section is required when <2 1 3> response is Unknown or 
is Artificial Sequence ■ ■ ■ t 

Sequence^) missing the <220> "Feature" and associated numeric identifiers and responses. 

•7. V*e of <220> to <223> is MANDATORY if<2 13> "Organism" response i* "Artificial Sequence" or 
"Unknown." Pleaso explain source of genetic material in <220> to <223> section, 
(See "Federal Register ,"06701/1998, VoL 63, No. 104, pp. 2963 1-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Palentln version 2.0. This cause* a corrupted file, 
resulting in mining mandatory numeric identifier* and responses (as indicated on raw sequence . . 

. listing). Instead, please use "File Manager" cr any other manual means to copy file to floppy dialc. ^ 



n can only bo used to represent a singlo nucleotide in a nucleic acid sequence. N is not used to represent 
any value not specifically a nucleotide. 
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PATENT APPLICATION: US/10/04 9,988 
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TIME: 
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Does Hot Comply 
Corrected Diskette Needec 



3 <110> APPLICANT: Biosyn Arzneimittel GmbH 

5 <120> TITLE OF INVENTION: Nucleic acid molecule comprising a nucleic acid sequence 

6 which codes for a haemocyanin, and comprising at least one 

7 intron sequence 

9 <130> FILE REFERENCE: PCT1220-01966 

11 <140> CURRENT APPLICATION NUMBER: US/10/049,988 

12 <141> CURRENT FILING DATE: 2002-02-20 

14 <160> NUMBER OF SEQ ID NOS : 108 

16 <170> SOFTWARE: Patentln Ver . 2.1 



ERRORED SEQUENCES 

412 <210> SEQ ID NO: 15 

413 <211> LENGTH: 1546 

414 <212> TYPE: DNA 

415 <213> ORGANISM: Haliotis tuberculata 

417 <400> SEQUENCE: 15 

418 agcccacaga ggaccagttg aagaaacaga agtcactcgc caacatactg acggcaatgc 60 

419 acactttcat cgtaaggaag ttgattcgct gtccctggat gaagcaaaca acttgaagaa 120 

420 tgccctttac aagctacaga acgaccacag tctaacggga tacgaagcaa tctctggtta 180 

421 ccatggatac cccaatctgt gtccggaaga aggcgatgac aaaatacccc tgctgcgtcc 240 

422 ccggatgggc atctttcctt actggcacag actcttgacc attcaactgg aaagagctct 300 

423 tgagcacaat ggtgcactgc ttggtgttcc ttactgggac tggaacaagg acctgtcgtc 360 

424 actgccggcg ttcttctccg actccagcaa caacaatccc tacttcaagt accacatcgc 4 20 

425 cggtgttggt cacgacaccg tcagagagcc aactagtctt atatataacc agccccaaat 4 80 

426 ccatggttat gattatctct attacctagc attgaccacg cttgaagaaa acaattactg 540 

427 ggactttgag gttcagtatg agatcctcca caacgccgtc cactcctggc ttggaggatc 600 

428 ccagaagtat tccatgtcta ccctggagta ttcggccttt gaccctgtct ttatgatcct 660 

429 tcactcgggt ctagacagac tttggatcat ctggcaagaa cttcagaaga tcaggagaaa 720 

430 gccctacaac ttcgctaaat gtgcttatca tatgatggaa gagccactgg cgcccttcag 780 

431 ctatccatct atcaaccagg acgagttcac ccgtgccaac tccaagcctt ctacagtttt 840 

432 tgacagccat aagttcggct accattacga taacctgaat gttagaggtc acagcatcca 900 

433 agaactcaac acaatcatca atgacttgag aaacacagac agaatctacg caggatttgt 960 

434 tttgtcaggc atcggtacgt ctgctagtgt caagatctat ctccgaacag atgacaatga 1020 

435 cgaagaagtt ggaactttca ctgtcctggg aggagagagg gaaatgccat gggcctacga 1080 

436 gcgagttttc aagtatgaca tcacagaggt tgcagataga cttaaaatta agttatgggg 1140 

437 acacccttta acttccggaa ctggagatca catccttacg aatggaatcg gtggtaaaca 1200 

438 agagcctacc ca aatccttt catcatctac agacctqcca atcatgacta cgatgttctt 1260 > 
E--> 439 gttatcccag ta^gaagaa accttcacat ccctcccaaa gttgtcgtca agaaaggcac 1320 / ^/X^~ 

440 ccgcatcgag ttccacccag tcgatgattc agttacgaga ccagttgttg atcttggaag 1380 j L jin>m ^ CTY\ 

441 ctacactgca ctcttcaact gtgtggtacc accgttcaca taccacggat tcgaactgaa 1440 ^ 

442 ccacgtctat tctgtcaagc ctggtgacta ctatgttact ggacccacga gagacctttg 1500 < [ / \_^/\J 
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443 ccagaatgca gatgtcagga ttcatatcca tgttgaggat gagtaa 154 



1855 


<210> SI 


3Q ID NO 


: 39 






















1856 


<211> LENGTH: 515 












7 
I J 










1857 


<212> TYPE : 


PRT 






















1858 


<213> ORGANISM: 


Haliotis tuberculata I 










1860 


<400> SEQUENCE: 


39 




















1861 


Gly 


Ala 


His 


Arg 


Gly 


Pro 


Val 


Glu 


Glu 


Thr 


Glu 


Val 


Thr 


Arg 


Gin His 


1862 


1 








5 










10 










15 


1864 


Thr 


Asp 


Gly 


Asn 


Ala 


His 


Phe 


His 


Arg 


Lys 


Glu 


Val 


Asp 


Ser 


Leu Ser 


1865 








20 










25 










30 




1867 


Leu 


Asp 


Glu 


Ala 


Asn 


Asn 


Leu 


Lys 




Ala 


Leu 


Tyr 


Lys 


Leu 


Gin Asn 


1868 






35 










40 










45 






1870 


Asp 


His 


Ser 


Leu 


Thr 


Gly 


Tyr 


Glu 


Ala 


He 


Ser 


Gly 


Tyr His Gly Tyr 


1871 




50 










55 










60 








1873 


Pro 




Leu 


Cys 


Pro 


Glu 


Glu 


Gly 


Asp 


Asd 


Lys 


He 


Pro 


Leu 


Leu Arg 


1874 


65 










70 










75 








80 


1876 


Pro 


Arg 


Met 


Gly 


He 


Phe 


Pro 


Tvr 


Trp 


His 


Arg 


Leu 


Leu 


Thr 


He Gin 


1877 










85 










90 










95 


1879 


Leu 


Glu 


A 


Ala 




Glu 


His 


Asn 


Gly 


Ala 


Leu 


Leu 


Gly Val 


Pro Tyr 


1880 








100 










105 










110 




1883 


Trp 


Asp 


Trp 




Lvs 


Asd 


Leu 


Ser 




Leu 


Pro 


Ala 


Phe 


Phe 


Ser Asp 


1884 






115 










120 










125 






1886 


Ser 


Ser 




Asn 


Asn 


Pro 


Tyr 




Lvs 


Tvr 


His 


He 


Ala 


Gly Val Gly 


1887 




130 










135 










140 








1889 


His 


Asp 


Thr 


Val 


Arg 


Glu 




Thr 


Ser 


Leu 


He 


Tyr 


Asn 


Gin 


Pro Gin 


1890 


145 










150 










155 








160 


1892 


lie 


His 


Gly 


Tvr 


Asp 


Tyr 


Leu 




T 


Leu 


Ala 


Leu 


Thr 


Thr 


Leu Glu 


1893 










165 










170 










175 


1895 


Glu 


Asn 


Asn 


Tyr 


Trp 


Asd 


Phe 


Glu 


Val 




Tyr 


Glu 


He 


Leu 


His Asn 


1896 








180 










185 










190 




1898 


Ala 


Val 


His 


Ser 




Leu 


Glv 


Gly 




Gin 


Lys 


Tyr 


Ser 


Met 


Ser Thr 


1899 






195 










200 










205 






1901 


Leu 


Glu 


Tyr 


Ser 


Ala 


Phe 


Asp 


Pro 


Val 


Phe 


Met 


He 


Leu 


His 


Ser Gly 


1902 




210 










215 










220 








1904 


Leu 


Asp 


Arg 


Leu 


Trp 


He 


He 


Trp 


Gin 


Glu 


Leu 


Gin 


Lys 


He 


Arg Arg 


1905 


225 










230 










235 








240 


1907 


Lys 


Pro 


Tyr 


Asn 


Phe 


Ala 


Lys 


Cys 


Ala 


Tyr 


His 


Met 


Met 


Glu 


Glu Pro 


1908 










245 










250 










255 


1910 


Leu 


Ala 


Pro 


Phe 


Ser 


Tyr 


Pro 


Ser 


He 


Asn 


Gin 


Asp 


Glu 


Phe Thr Arg 


1911 








260 










265 










270 




1913 


Ala 


Asn 


Ser 


Lys 


Pro 


Ser 


Thr 


Val 


Phe 


Asp 


Ser 


His 


Lys Phe Gly Tyr 


1914 






275 










280 










285 






1916 


His 


Tyr 


Asp 


Asn 


Leu 


Asn 


Val 


Arg 


Gly 


His 


Ser 


He 


Gin 


Glu 


Leu Asn 


1917 




290 










295 










300 








1919 


Thr 


He 


He 


Asn 


Asp 


Leu 


Arg 


Asn 


Thr 


Asp 


Arg 


He 


Tyr Ala Gly Phe 


1920 


305 










310 










315 








320 


1922 


Val 


Leu 


Ser 


Gly 


He 


Gly 


Thr 


Ser 


Ala 


Ser 


Val 


Lys 


He 


Tyr 


Leu Arg 


1923 










325 










330 










335 


1925 


Thr 


Asp 


Asp 


Asn 


Asp 


Glu 


Glu 


Val 


Gly 


Thr 


Phe 


Thr 


Val 


Leu Gly Gly 
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1926 








340 










345 




350 






1928 


Glu 


Arg 


Glu 


Met 


Pro 


Trp 


Ala 


Tyr 


Glu 


Arg Val Phe 


Lys Tyr 


Asp 


He 


1929 






355 










360 






365 






1931 


Thr 


Glu 


Val 


Ala 


Asp 


Arg 


Leu 


Lys 


He 


Lys Leu Trp 


Gly His 


Pro 


Leu 


1932 




370 










375 






380 








1934 


Thr 


Ser 


Gly 


Thr 


Gly 


Asp 


His 


He 


Leu 


Thr Asn Gly 


He Gly 


Gly 


Lys 


1935 


385 










390 








395 






400 


1937 


Gin 


Glu 


Pro 


Thr 


Gin 


He 


Leu 


Ser 


Ser 


Ser Thr Asp 


Leu Pro 


He 


Met 


1938 










405 










U10 




415 




1940 


Thr 


Thr 


Met 


Phe 


Leu 


Leu 


Ser 


Gln| 


,Xaa 


Ely Arg Asn 


Leu His 


He 


Pro 


1941 




























1943 


Pro 


Lys 


Val 


Val 


Val 


Lys 


Lys 


Gly 


Thr 


Arg He Glu 


Phe His 


Pro 


Val 


1944 






435 










440 






445 






1946 


Asp 


Asp 


Ser 


Val 


Thr 


Arg 


Pro 


Val 


Val 


Asp Leu Gly 


Ser Tyr 


Thr 


Ala 


1947 




450 










455 






460 








1949 


Leu 


Phe 


Asn 


Cys 


Val 


Val 


Pro 


Pro 


Phe 


Thr Tyr His 


Gly Phe 


Glu 


Leu 


1950 


465 










470 








475 






480 


1952 


Asn 


His 


val 


Tyr 


Ser 


Val 


Lys 


Pro Gly Asp Tyr Tyr 


Val Thr 


Gly 


Pro 


1953 










485 










490 




495 




1955 


Thr 


Arg 


Asp 


Leu 


Cys 


Gin 


Asn 


Ala 


Asp 


Val Arg He 


His He 


His 


Val 


1956 








500 










505 




510 






1958 


Glu 


Asp 


Glu 























1959 515 

2852 <210> SEQ ID NO: 62 

2853 <211> LENGTH: 1185 

2854 <212> TYPE: DNA 

2855 <213> ORGANISM: Haliotis tuberculata 

2857 <400> SEQUENCE: 62 

2858 atcatattgc tggcagtgga gtcaggaaag acgtgacgtc tcttaccgca tctgagatag 60 * 

2859 agaacctgag gcatgctctg caaagcgtga tggatgatga t ggacccaat ggattccagg 120 j Q 

28 60 caattgctgc ttatcacgga agtcctccca tgtgtcacat gcmtgatggt agagacgttg 180 / /sU*" / 

2861 catgttgtac tcatggaatg gcatctttcc ctcactggca cagactgttt gtgaaacaga 240 

2862 tggaggatgc actggctgcg catggagctc acattggcat accatactgg gattggacaa 300 

2863 gtgcgtttag tcatctgcct gccctagtga ctgaccacga gcacaatccc ttccaccacg 360 

2864 gacatattgc tcatcggaat gtggatacat ctcgatctcc gagagacatg ctgttcaatg 420 

2865 accccgaaca cgggtcagaa tcattcttct atagacaggt tctcttggct ctagaacaga 480 

2866 cagacttctg ccaatttgaa gttcagtttg aaataacaca caatgcaatc cactcttgga 540 

2867 ctggaggaca tactccatat ggaatgtcat cactggaata tacagcatat gatccactct 600 

2868 tttatctcca ccattccaac actgatcgta tctgggccat ctggcaggca ctccagaaat 660 

2869 acagaggttt tcaatacaac gcagctcatt gcgatatcca ggttctgaaa caacctctta 720 

2870 aaccattcag cgagtccagg aatccaaacc cagtcaccag agccaattct agggcagtcg 780 

2871 attcatttga ttatgagaga ctcaattatc aatatgacac acttaccttc cacggacatt 840 

2872 ctatctcaga acttgatgcc atgcttcaag agagaaagaa ggaagagaga acatttgcag 900 

2873 ccttcctgtt gcacggattt ggcgccagtg ctgatgtttc gtttgatgtc tgcacacctg 960 

2874 atggtcattg tgcctttgct ggaaccttcg cggtacttgg tggggagctt gagatgccct 1020 

2875 ggtcctttga aagattgttc cgttacgata tcacaaaggt tctcaagcag atgaatcttc 1080 

2876 actatgattc tgagttccac tttgagttga agattgttgg cacagatgga acagaactgc 1140 

2877 catcggatcg tatcaagagc cctaccattg aacaccatgg aggag 1185 
4137 <210> SEQ ID NO: 79 
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4138 


<211> LENGTH: 395 
























4139 


<212> TYPE: 


PRT 




























<213> ORGANISM: 


Megathura crenulata 
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<400> SEQUENCE: 
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His 


lie 


Ala 


Gly 


Ser Gly Val Arg Lys Asp Val 


Thr 


Ser 


Leu 


Thr 
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10 










15 




4146 


Ala 


Ser 


Glu 


lie 


Glu 


Asn 


Leu 


Arg 


His 


Ala 


Leu 


Gin 
n 


Ser 


Val 


Met 


Asp 


4147 








20 










25 










30 






4149 


sp 


sp 




Pro 


sn 


Gly Phe Gin Ala 


He Ala 


Ala 


Tyr His Gly Ser 


4150 






35 










\ 40 










45 










ro 


Pro 


Met 


Cys 


is 








Gly 


— 

Arg 


Asp 


— 


Ala 


Cys 


Cys 


— — — 
r 


4153 




50 




















60 










4155 


His 


Y 


Met 


Ala 


er 




Pro 


His 


irp 


is 


Arg 


Leu 
eu 


Phe 


Val 


ys 


Gin 














vn 




















fin 


4158 


Met 


Glu 


sp 


Ala 


Leu 


A a 


A a 


is 






is 




Y 


e 




Tyr 


4159 










85 










on 










95 




4161 


rp 


sp 


Trp 


Thr 


Ser 


A a 


P e 


er 




Leu 


Pro 


Ala 
a 


Leu 


y a 


r 


Asp 










100 


























4164 


is 


U 




sn 


ro 


P e 


HIS 


. 




is 


I e 


a 


. 


Arg 


Asn 


a 


4165 






IS 




























4167 


sp 


Thr 


er 
er 


Arg 


er 


Pro 




Asp 


Met 


Leu 


P e 


s " 


Asp 


Pro 


Glu 


His 




































4170 




er 


U 


er 


Ph 


P J~ 


Tyr 


Arg 


Gin 


Va 




eu 


A a 


Leu 


G u 




4171 


145 
































4173 


r 


sp 


Ph 


Cys 




Phe 


G u 


Val 


In 




G u 


I e 


T r 


His 




A a 


4174 


































4176 


6 


is 


er 




r 


G y 


G y 


His 


Thr 


Pro 


Tyr 


y 


Met 


Ser 


Ser 


Leu 


4177 








fin 










185 










190 






4179 


U 


Tyr 






Tyr 


Asp 


Pro 


Leu 


Phe 


Tyr 


Leu 


is 


His 


Ser 


Asn 


Thr 


















200 










205 








4182 


sp 


Arg 


e 


Trp 


A a 


He 


Trp 


Gin 


Ala 


Leu 


Gin 


Lys 


Tyr 


Arg 


Gly 


Phe 
















215 










220 










4185 


Gin 


Tyr 


Asn 


Ala 




His 


Cys 


Asp 


He 


Gin 


Val 


Leu 


Lys 


Gin 


Pro 


Leu 


4186 


225 










230 










235 










240 


4188 


Lys 


Pro 


Phe 


Ser 


Glu 


Ser 


Arg 


Asn 


Pro 


Asn 


Pro 


Val 


Thr 


Arg 


Ala 


Asn 


4189 










245 










250 










255 




4191 


Ser 


Arg 


Ala 


Val 


Asp 


Ser 


Phe 


Asp 


Tyr 


Glu 


Arg 


Leu 


Asn 


Tyr 


Gin 


Tyr 


4192 








260 










265 










270 






4194 


Asp 


Thr 


Leu 


Thr 


Phe 


His 


Gly 


His 


Ser 


He 


Ser 


Glu 


Leu 


Asp 


Ala 


Met 


4195 






275 










280 










285 








4197 


Leu 


Gin 


Glu 


Arg 


Lys 


Lys 


Glu 


Glu 


Arg 


Thr 


Phe 


Ala 


Ala 


Phe 


Leu 


Leu 


4198 




290 










295 










300 










4200 


His 


Gly 


Phe 


Gly 


Ala 


Ser 


Ala 


Asp 


Val 


Ser 


Phe 


Asp 


Val 


Cys 


Thr 


Pro 


4201 


305 










310 










315 










320 


4203 


Asp 


Gly 


His 


Cys 


Ala 


Phe 


Ala 


Gly 


Thr 


Phe 


Ala 


Val 


Leu 


Gly 


Gly 


Glu 


4204 










325 










330 










335 




4206 


Leu 


Glu 


Met 


Pro 


Trp 


Ser 


Phe 


Glu 


Arg 


Leu 


Phe 


Arg 


Tyr 


Asp 


lie 


Thr 


4207 








340 










345 










350 






4209 


Lys 


Val 


Leu 


Lys 


Gin 


Met 


Asn 


Leu 


His 


Tyr 


Asp 


Ser 


Glu 


Phe 


His 


Phe 
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RAW SEQUENCE LISTING^ 

PATENT APPLICATION: " US/10/049, 988 



DATE: 03/05/2002 
TIME: 14:00:57 



Input Set : A:\seqlist-l.txt 

Output Set: N:\CEF3\03052002\J049988.raw 



4210 
4212 
4213 
4215 
4216 
4637 
4638 
4639 
4640 
4642 
4643 
4644 
4645 
4646 
4647 
4648 
4649 
4650 
4651 
4652 
4653 
4654 
4655 
4656 
4657 
4658 
4659 
4660 
4661 
4662 
4663 

> 4664 
4665 
4666 
4667 
4668 
4961 
4962 
4963 
4964 
4966 
4967 
4968 

• 4969 
4970 
4971 
4972 
4973 
4974 



355 360 
Glu Leu Lys He Val Gly Thr Asp Gly Thr Glu 

370 375 
He Lys Ser Pro Thr He Glu His His Gly Gly 
385 390 395 

<210> SEQ ID NO: 95 
<211> LENGTH: 1548 
<212> TYPE: DNA 

<213> ORGANISM: Haliotis tuberculata 
<400> SEQUENCE: 95 

ggagcccaca gaggaccagt tgaagaaaca gaagtcactc 
gcacactttc atcgtaagga agttgattcg ctgtccctgg 
aatgcccttt acaagctaca gaacgaccac agtctaacgg 
taccatggat accccaatct gtgtccggaa gaaggcgatg 
ccccggatgg gcatctttcc ttactggcac agactcttga 
cttgagcaca atggtgcact gcttggtgtt ccttactggg 
tcactgccgg cgttcttctc cgactccagc aacaacaatc 
gccggtgttg gtcacgacac cgtcagagag ccaactagtc 
atccatggtt atgattatct ctattaccta gcattgacca 
tgggactttg aggttcagta tgagatcctc cacaacgccg 
tcccagaagt attccatgtc taccctggag tattcggcct 
cttcactcgg gtctagacag actttggatc atctggcaag 
aagccctaca acttcgctaa atgtgcttat catatgatgg 
agctatccat ctatcaacca ggacgagttc acccgtgcca 
tttgacagcc ataagttcgg ctaccattac gataacctga 
caagaactca acacaatcat caatgacttg agaaacacag 
gttttgtcag gcatcggtac gtctgctagt gtcaagatct 
gacgaagaag ttggaacttt cactgtcctg ggaggagaga 
gagcgagttt tcaagtatga catcacagag gttgcagata 
ggacaccctt taacttccgg aactggagat cacatcctta 
caagagccta ccca aatcct ttcatc atct .acagac.ctg.c.. 



365 

Leu Pro Ser Asp Arg 



gccaacatac 
atgaagcaaa 
gatacgaagc 
acaaaatacc 
ccattcaact 
actggaacaa 
cctacttcaa 
ttatatataa 
cgcttgaaga 
tccactcctg 
ttgaccctgt 
aacttcagaa 
aagagccact 
actccaagcc 
atgttagagg 
acagaatcta 
atctccgaac 
gggaaatgcc 
gacttaaaat 
cgaatggaat 



ttgttatccc agta(n£gaag aaaccttcac atccctccca 

acccgcatcg agttccaccc agtcgatgat tcagttacga 
agctacactg cactcttcaa ctgtgtggta ccaccgttca 
aaccacgtct attctgtcaa gcctggtgac tactatgtta 
tgccagaatg cagatgtcag gattcatatc catgttgagg 
<210> SEQ ID NO: 107 
<211> LENGTH: 1185 
<212> TYPE': DNA 

<213> ORGANISM: Megathura crenulata 
<400> SEQUENCE: 107 

gatcatattg ctggcagtgg agtcaggaaa gacgtgacgt 
gagaacctga ggcatgctct gcaaagcgtg atggatgatg 
gcaattgctg cttatcacgg aagtcctccc atgtgtcaca 
gcatgttgta ctcatggaat ggcatctttc cctcactggc 
atggaggatg cactggctgc gcatggagct cacattggca 
agtgcgttta gtcatctgcc tgccctagtg actgaccacg 
ggacatattg ctcatcggaa tgtggataca tctcgatctc 
gaccccgaac acgggtcaga atcattcttc tatagacagg 



aagttgtcgt 

gaccagttgt 
cataccacgg 
ctggacccac 
atgagtaa 



ctcttaccgc 
at ggacccaa 
tgc^gatgg 



tgacggcaat 60 
caacttgaag 120 
aatctctggt 180 
cctgctgcgt 240 
ggaaagagct 300 
ggacctgtcg 360 
gtaccacatc 420 
ccagccccaa 480 
aaacaattac 540 
gcttggagga 600 
ctttatgatc 660 
gatcaggaga 720 
ggcgcccttc 780 
ttctacagtt 840 
tcacagcatc 900 
cgcaggattt 960 
agatgacaat 1020 
atgggcctac 1080 
taagttatgg 1140 
cggtggtaaa 1200 * 

Jcangsrgt.t.r. 1260 ^ i Q 

caagaaaggc 1320 / 
tgatcttgga 1380 
attcgaactg 14 4 0 
gagagacctt 1500 
1548 



atctgagata 60 
tggattccag 120 



acagactgtt 
taccatactg 
agcacaatcc 
cgagagacat 
ttctcttggc 



tagagacgtt 180 

tgtgaaacag 240 
ggattggaca 300 
cttccaccac 360 
gctgttcaat 4 20 
tctagaacag 480 
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RAW SEQUENCE LISTING ^ DATE: 03/05/2002 

PATENT APPLICATION: * US/10/049, 988 TIME: 14:00:58 



Input Set : A:\seqlist-l.txt 

Output Set: N:\CRF3\03052002\J049988.raw 

4975 acagacttct gccaatttga agttcagttt gaaataacac acaatgcaat ccactcttgg 540 

4 976 actggaggac atactccata tggaatgtca tcactggaat atacagcata tgatccactc 600 

4 977 ttttatctcc accattccaa cactgatcgt atctgggcca tctggcaggc actccagaaa 660 

4978 tacagaggtt ttcaatacaa cgcagctcat tgcgatatcc aggttctgaa acaacctctt 720 

4979 aaaccattca gcgagtccag gaatccaaac ccagtcacca gagccaattc tagggcagtc 780 

4980 gattcatttg attatgagag actcaattat caatatgaca cacttacctt ccacggacat 840 

4981 tctatctcag aacttgatgc catgcttcaa gagagaaaga aggaagagag aacatttgca 900 
4 982 gccttcctgt tgcacggatt tggcgccagt gctgatgttt cgtttgatgt ctgcacacct 960 
4 983 gatggtcatt gtgcctttgc tggaaccttc gcggtacttg gtggggagct tgagatgccc 1020 

4984 tggtcctttg aaagattgtt ccgttacgat atcacaaagg ttctcaagca gatgaatctt 1080 

4985 cactatgatt ctgagttcca ctttgagttg aagattgttg gcacagatgg aacagaactg 1140 

4986 ccatcggatc gtatcaagag ccctaccatt gaacaccatg gagga 1185 

4988 <210> SEQ ID NO: 108 

4989 <211> LENGTH: 309 
4 990 <212> TYPE: DNA 

4 991 <213> ORGANISM: Megathura crenulata 
4993 <400> SEQUENCE: 108 

4 994 ggtcacgatc acagtgaacg tcacgatgga tttttcagga aggaagtcgg ttccctgtcc 60 

4995 ctggatgaag ccaatgacct taaaaatgca ctgtacaagc tgcagaatga tcagggtccc 120 

4996 aatggatatg aatcaatagc cggttaccat ggctatccat tcctctgccc tgaacatggt 180 

4997 gaagaccagt acgcatgctg tgtccacgga atgcctgtat ttccacattg gcacagactt 240 

4998 catacaatcc agtttgagag agctctcaaa gaacatggtt ctcatttggg tctgccatac 300 

4999 tgggactgg 309 
5005/" 



0 



W&Pt$n and/or XiW has been detected in the Sequence Listing. 
/ Review the Sequence Listing to insure a corresponding 
^ y explanation is presented in the <220> to <223> fields of 
each sequence using n or Xaa. 
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# 

VERIFICATION SUMMARY DATE: 03/05/2002 

PATENT APPLICATION: * US/10/049, 988 TIME: 14:00:59 

Input Set : A:\seqlist-l.txt 

Output Set: N:\CRF3\03052002\J049988.raw 

L:ll M-.270 C: Current Application Number differs, Replaced Application Number 
L:12 M:271 C: Current Filing Date differs, Replaced Current Filing Date 



L 


439 M:340 E: 


(46) 


"n" 


or " 


Xaa" 


used: 


Feature required, 


for SEQ ID# : 


15 


L 


1940 


M:340 E 


(46) 


H n i 


or 


"Xaa 


used 


Feature required 


for 


SEQ 


ID# 


:39 


L 


2860 


M:340 E 


(46) 


n n i 


or 


"Xaa 


used 


Feature required 


for 


SEQ 


ID# 


:62 


L 


4152 


M:340 E 


(46) 


"n' 


or 


"Xaa 


used 


Feature required 


for 


SEQ 


ID# 


:79 


L 


4664 


M:340 E 


(46) 


"n" 


or 


"Xaa 


used 


Feature required 


for 


SEQ 


ID# 


:95 


L 


4969 


M:340 E 


(46) 


" n ' 


or 


"Xaa 


used 


Feature required 


for 


SEQ 


ID# 


:107 


L 


5005 


M:254 E 


No. 


of Bases 


conflict, 


LENGTH: Input: 8 Counted 


309 


SEQ 


:108 
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